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1. Introduction 



Physics, chemistry and biology are all sciences describing different aspects of the same 
world, yet their practice differs enormously... why is that? To a large extent the cultural 
differences between the fields can be explained in terms of fundamental difference between 
the physical systems being analyzed: A biological system typically has many unrelated 
energy scales of comparable magnitude, such as the low lying excitation spectra of various 
different enzymes collaborating in a biological process. While it is possible to crudely 
explain why the body temperature of a human is O(10 2 ) K in terms the fine structure 
constant a and the masses of the proton and the electron, you will not be able to explain 
why a human is healthy with a temperature of 310.5 K but not with a temperature of 
320.5 K without a thorough understanding of human physiology. 

In contrast, a physics problem tends to involve energy scales that are widely separated, 
which allows one — with care — to determine many of the properties of a system using 
the tool of dimensional analysis. To see how this works, we first choose physical units with 
the speed of light and Planck's constant set to unity: 



Then all physical parameters can be said to have the dimension of mass to some power. In 
particular, if some quantity X has dimensions [mass] n , we will just say "X has dimension 
n" or [X] = n. You should be able to convince yourself that 



h=c= 1 . 



[volume] ~ [ / d 3 



x] 



= -3 



[Gn] ~ [Gf] 



= -2 



[length] ~ [time] ~ \pn] 



= -1 



[velocity] ~ [a] = 



[energy] ~ [momentum] ~ [Aqcd] ~ [d/dx] ~ [d/dt] = 1 



[E] ~ [B] = 2 
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and so forth 0. Of particular interest to us will be the dimension of a Lagrange density; 
since f d 4 xC is an action — which comes in units of h and is dimensionless — it follows 
that 

[C}=4. 

Let us use dimensional analysis to discuss properties of the hydrogen atom. To a 
first approximation, the system is described in terms of one dimensionful parameter, the 
electron mass m e , and one dimensionless number, the fine structure constant a = 1/137. 
If we want to estimate the size ao of a hydrogen atom, since length has dimension [mass] -1 
it follows that ao oc m" 1 , where the proportionality constant is dimensionless. What is 
it? We would guess it is some number of order unity, times some power of a. Alas, 
dimensional analysis doesn't tell us the power... we have to look at the dynamics to realize 
that the appropriate power is a -1 . We arrive at ao — l/(am e ), which in fact is the exact 
expression for the Bohr radius. What about the ground state binding energy Eq of the 
hydrogen atom? Eq has the dimensions of [mass], and dynamics gives us a proportionality 
factor of a 2 , so we estimate Eq ~ ct 2 m e , which is in fact only a factor of 2 off from the 
correct value. 

We can go on and ask what wavelength of photon allows one to examine crystal 
structure by means of diffraction. Since atomic sizes are given by ao = l/(am e ), the atomic 
spacing in a crystal is expected to be similar. It follows that to see crystal structure, we 
need photons with wavelength A < ao, or equivalently energy E 1 > am e = O(10 KeV) - 
visible light won't do, we need X-rays. On the other hand, if we wish to estimate the energy 
of light emitted from an atomic transition in hydrogen, get E 1 < Eq ~ a 2 m e = O(10 eV), 
corresponding to a wavelength A 7 ~ clq/cy, several orders of magnitude larger than the 
atom itself. 

The above analysis may seem familiar and unimpressive — after all, while it is nice 
that one can easily determine how m e enters quantities of physical interest, one also has 
to keep track of powers of a which involves going back and examining the Schrodinger 

1 For practical purposes it is conventional to express everything in units of energy (eV) rather 
than mass (gm). Thus m p ~ 940 GeV, m e ~ .511 MeV, 1 F = 10~ 13 cm ~ (200MeV) _1 , 
IK ~ !0- 4 eV, etc. 
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equation for hydrogen (which we know how to solve anyway). Yet there is something 

remarkable about the analysis, and it lies in the sentence: 

"To a first approximation, the system is described in terms of one dimensionful param- 
eter, the electron mass m e , and one dimensionless number, the fine structure constant 
a = 1/137." 

Why should this be true, and what is the approximation we are making? Why is the 
system insensitive to the proton mass? Or the W and Z boson masses? Or Newton's 
constant Gat? Why don't we need to take into account the bottom quark mass, mj, — 5 
GeV? The ratio r = mi jm\ = 10 8 is a dimensionless number; couldn't the ground state 
energy of the hydrogen atom be some function of r, namely Eq = /(r)ct 2 m e , where f(r) 
could as easily equal 10 8 as 10 -8 ? 

The technique of constructing effective theories allows one to answer such questionsi. 
The basic idea is not to attempt to construct "a theory of everything" , but to construct 
an effective theory that is appropriate to the energy scale of the experiments one is in- 
terested in. A theory of everything is beyond our abilities to construct since we cannot 
probe everything experimentally, and even if we could, it would contain lots of information 
extraneous to any particular experiment. 

Effective field theory techniques get interesting when we wish to look at effects over a 
wide range of energies: then we must understand how effective theories at different scales 
are related to each other. This is useful if one wishes to relate experiments over a large 
range of energy scales, or if one has a theory of high energy physics and wishes to predict 
the results of low energy experiments. In the example of the hydrogen atom and the b 
quark, one can show that Eq depends on the b quark mass in the following way: 

E = \a 2 m e (1 + 0{ml/m 2 b )) . 

There is a small power law correction to the naive value oc (ml/ml) ~ 10 -8 , as well 
as a hidden dependence of a on the b quark mass. When one is only concerned with 

2 The concept of effective field theory is mainly associated with Ken Wilson, although it is an 
edifice with many architects. For two quite dissimilar modern treatments see the reviews by J. 
Polchinski g] and H. Georgi [|. 
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atomic physics, one can ignore the m& dependence of a, since it is already incorporated 
in the measured physical value a = 1/137. Effective field theories however allow you to 
simply compute electromagnetic scattering of electrons at a center-of-mass energy ~ 100 
GeV, where one finds that the appropriate value of for the fine structure "constant" is 
a ~ 1/128 — a change due in part to the effects of the b quark. 
The outline of these lectures is as follows: 

1. First I discuss how to construct effective theories as an expansion in operators consis- 
tent with low energy symmetries, and how to use dimensional analysis to extract the 
interesting physics; 

2. Next I explain how the dimension of the operator determines whether it is irrelevant, 
relevant or "marginal" to low energy physics; 

3. I then consider "matching" : how one relates the parameters of a low energy theory to 
those of a higher energy theory; 

4. I then show how quantum corrections can sometimes change the dimension of an 
operator and therefore radically change low energy physics. 

5. Finally I mention the application of some of these ideas to the strong interactions. 
Each section is followed by some exercises (of widely varying difficulty); I encourage you 
to work through them. 



Exercise 1. Estimate the energy scale of rotational excitations of water in terms of m p , 
m e and a. Does your answer explain why microwaves are used to heat food? 



2. Dimensional analysis, symmetries, and the separation of scales 

The basic idea behind effective field theories is that a physical process typified by some 
energy E can be described in terms of an expansion in E/Ai, where the are various 
physical scales involved in the problem which have dimension 1 and which are bigger than 
E. In this section we show how this simple idea can be incorporated into a predictive 
framework. 

4 



2.1. Example 1: Why the sky is blue. 

Consider the question of why the sky is blue. More precisely, consider the problem 
of low energy light scattering from neutral atoms in their ground state, where by "low 
energy" I mean that the photon energy E 1 is much smaller than the excitation energy AE 
of the atom, which is of course much smaller than its inverse size or mass: 

E 1 < AE < ciq 1 < M atom . 

Thus the process is necessarily elastic scattering, and to a good approximation we can 
ignore that the atom recoils, treating it as infinitely heavy. Let's construct an "effective 
Lagrangian" to describe this process. This means that we are going to write down a La- 
grangian with all interactions describing elastic photon-atom scattering that are allowed by 
the symmetries of the world — namely Lorentz invariance and gauge invariance. Photons 
are described by a field which creates and destroys photons; a gauge invariant object 
constructed from is the field strength tensor = d^A v — d v A il . The atomic field is 
defined as <f> v , where 4> v destroys an atom with four- velocity (satisfying v^v^ = 1, with 
= (1, 0, 0, 0) in the rest-frame of the atom), while <j>l creates an atom with four- velocity 
u M . So what is the most general form for £ e //? Since the atom is electrically neutral, 
gauge invariance implies that (j) can only be coupled to F^ v and not directly to A^. So 
C e ff is comprised of all local, Hermitian monomials in (f)' v (f) v , F^jV^, and <9 M . Certain 
combinations we needn't consider for the problem at hand — for example d^F^ = for 
radiation (by Maxwell's equations); also, if we define the energy of the atom at rest in it's 
ground state to be zero, then v^d^4> = 0, since = (1,0,0,0) in the rest frame, where 
d t (f) = 0. Similarly, d^d^cp = 0. Thus we are led to consider the Lagrangian 

C eff = ckPI^F^F^ + c 2 fy v v a F atx v^ 

(2-1) 

+ c 3 ^„(A)V F + ... 
The above expression involves an infinite number of operators and an infinite number 
of unknown coefficients! Nevertheless, dimensional analysis allows us to identify the leading 
contribution to low energy scattering of light by neutral atoms. It is straightforward to 
figure out that 

[d,] = 1 , [F, v ] = 2, [0] = § • 
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The first follows from the fact that <9 M has the dimension of 1 /length. The second is easily 
determined by noting that the Maxwell Lagrangian is Cm = —\F 2 -, and that [C] = 4. 
Finally (ft is determined by writing a state with no atom as |0), and one atom as \A), 
where <fy (x) |0) = ^a(x) \A), with ^a(x) being the normalized atomic wavefunction and 
(0| 0) = (A\ A) = 1. Since / d 3 x \^ A \ 2 = 1, it follows that [(ft) = 3/2. 

Since the effective Lagrangian has dimension 4, the coefficients ci, c 2 etc. also have 
dimensions. It is easy to see that they all have negative mass dimensions: 

[ci] = [c 2 ] = -3 , [c 3 ] = -4 

and that operators involving higher powers of d ■ v would have coefficients of even more 
negative dimension. It is crucial to note that these dimensions must be made from di- 
mensionful parameters describing the atomic system — namely its size ro and the energy 
gap 5E between the ground state and the excited states. The other dimensionful quantity, 
Ej, is explicitly represented by the derivatives 9 M acting on the photon field. Thus for 
E 1 <C AE, Tq 1 the dominant effect is going to be from the operator in £ e // which has the 
lowest dimension. There are in fact two leading operators, the first two in eq. ( [2.1| ), both 
of dimension 7. Thus low energy scattering is dominated by these two operators, and we 
need only compute c\ and ci- 

What are the sizes of the coefficients? To do a careful analysis one needs to go back 
to the full Hamiltonian for the atom in question interacting with light, and "match" the 
full theory to the effective theory. We will discuss this process of matching later, but for 
now we will just estimate the sizes of the Cj coefficients. We first note that extremely low 
energy photons cannot probe the internal structure of the atom, and so the cross-section 
ought to be classical, only depending on the size of the scatterer. Since such low energy 
scattering can be described entirely in terms of the coefficients c\ and C2, we conclude that 

a ~ c 2 ~ To • 

The effective Lagrangian for low energy scattering of light is therefore 

£eff = 4 (a 1( ftt(ft v F^F^ + a 2 (ftl(ft v v a F a ^pF^) (2.2) 
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where a\ and a% are dimensionless, and expected to be 0(1). The cross-section (which 
goes as the amplitude squared) must therefore be proportional to r®. But a cross section 
a has dimensions of area, or [a] = —2, while [r®] = —6. Therefore the cross section must 
be proportional to 

o oc Ey , (2.3) 

growing like the fourth power of the photon energy. Thus blue light is scattered more 
strongly than red, and the sky looks blue. 

Is the expression (^73|) valid for arbitrarily high energy? No, because we left out terms 
in the effective Lagrangian we used. To understand the size of corrections to (|2.3| ) we 
need to know the size of the C3 operator (and the rest we ignored). Since [03] = —4, 
we expect the effect of the C3 operator on the scattering amplitude to be smaller than 
the leading effects by a factor of E 7 /A, where A is some energy scale. But does A equal 
M a torrn f^ 1 ~ am e or AE ~ a 2 m e ? The latter is the smallest scale and hence the most 
important. We expect our approximations to break down as E^ — > AE since for such 
energies the photon can excite the atom. Hence we predict 

aoc£% 6 (l + 0(£ 7 /A£)). (2.4) 

The Rayleigh scattering formula ought to work pretty well for blue light, but not very far 



into the ultraviolet. Note that eq. fl2.4T) contains a lot of physics even though we did very 



little work. More work is needed to compute the constant of proportionality. 

2.2. Example 2: The binding energy of charmonium to nuclei. 

Closely related to the above example is the calculation of the binding energy of charmo- 
nium (a cc bound state, where c is the charm quark) to nuclei. In the limit that the charm 
quark mass m c is very heavy, the charmonium meson can be thought of as a Coulomb bound 
state, with size ~ a s (m c )m c , where a s (m c ) is a small number (more on this later). When 
inserted in a nucleus, it will interact with the nucleons by exchanging gluons with nearby 
quarks. Typical momenta for gluons in a nucleus is set by the QCD scale Aqcd — 200 
MeV. For large m c then, the wavelength of gluons will be much larger than the size of the 
charmonium meson, and so the relevant interaction is the gluon-charmonium analogue of 



¥i ¥3 ¥1 ¥3 
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Fig. 1. (a) Tree level W and Z exchange between four fermions. (b) 
The effective vertex in the low energy effective theory (Fermi interac- 
tion). 



photon-atom scattering considered above. The effective Lagrangian is just given by (|2.2|), 
where <p now destroys charmonium mesons, and F^ v is replaced by G®, the field strength 
for gluons of type a = 1, ... ,8. The coefficients 01,2 may be computed from QCD. To 
compute the binding energy of charmonium we need to compute the matrix element 

(N,cc\ [ d 3 x(P t (j)G a ^G a ^ \N,cc) 



(as well as the matrix element of the other operator in (|2.2| )), which we do not know 
how to do precisely since the system is strongly interacting. We can estimate its size by 
dimensional analysis though, getting 

A 4 

rn 3.4 Il QCD 

E B ~ r A QCD * — y . 
This problem is discussed with greater sophistication in ref. 0. 

2.3. Example 3: The cross section for low energy neutrino interactions. 

The term "weak interactions" refers in general to any interaction mediated by the W 
or Z bosons, whose masses are 80 GeV and 91 GeV respectively. Since their couplings are 
rather weak, it is usually a decent approximation to only consider first order perturbation 
theory, namely Feynman graphs fig. 1(a). 

These interactions describe 2 — > 2 scattering of fermions, or 1 — > 3 decays. The W 
and Z propagators (in a particular choice of gauge) are given by —ig^vlif^ — M 2 ), where 
q is the four-momentum transferred. For low energy processes, q 2 M 2 and one never 
has enough energy to make a physical W or Z, so there is no reason to include them in 



the theory. Thus the low energy effective theory just has the contact interactions shown 
in figure lb: 

~ G<iPiip 2 ip3ip4, (2.5) 

where ipi represent fermion fields (for either quarks or leptons). Since the Lagrangian for 
a noninteracting fermion is £/ = ifj(i0 — m)ip, it follows that 

M = i > 



and so the coupling G in eq.(|2.5|) has dimension [G] = —2. You can estimate its size by 



equating the processes fig la and fig. lb and it is roughly given by g 2 /M 2 , where g and 
M are the dimensionless coupling constant and mass of the W or Z. (This is "matching" , 
and you will do this more precisely in a later exercise). 

Since neutrinos only interact through the weak force, it follows that low energy neu- 
trinos (E u <C M\v) interact with matter through an operator of the form ( |2.5| ), where two 
of the ip's are neutrino fields, and the other two are either quark or lepton fields. Thus the 
neutrino cross-section a, which has dimension -2, must be proportional to G 2 which has 
dimension -4. Therefore the cross-section must scale with energy as 

a v ~ G 2 s (2.6) 

for low energy neutrinos, where s equals the square of the total energy in the center of 
momentum frame. 



Exercise 2. Use the effective Lagrangian to explain why the force between to static 
neutral atoms at a separation R ^> ao scales like 1/R 7 . You should be able to get this 
from dimensional analysis of the two photon exchange process. Can you explain why there 
isn't any contribution from one photon exchange due to the operator (f>1idn<f) v v V F^ V ? Can 
you explain why the approximations made in the effective field theory are expected to 
be invalid for R < ao/a? For a detailed discussion of why one finds 1/R 7 instead of the 
nonrelativistic result 1/R 6 , see ref. ||/. 
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Exercise 3. The \x and the r have the same weak interactions, and so the amplitudes for 
decay via W exchange fx — > e~V e h>^ and r — > eV e y T are equal. Since the r is heavier, it has 
more ways to decay than the \i. The mass and lifetimes of the two particles are 

m M = 106 MeV , T M = 2.2 x 10" 6 sec, 

m T = 1777 MeV , T T = 3.0 x 10" 13 sec. 

Given that the ii decays 100% of the time via \i — > eV e i>^, calculate the fraction of r decays 
which are of the form r — > ev^.v^. All you need to know is that [G] = —2 in eq. ( |2.5|) . How 
does your answer compare with the observed branching ratio BR T ^ e j; cVT = 18.01 ±0.18%? 



Exercise 4. The partial mean lifetime of the proton in the decay p — > e + n is known to be 
greater than 1.3 x 10 32 years. Suppose that new physics at a scale A does give rise to this 
decay (for example, through the tree level exchange of a particle with mass A, analogous 
to the interaction in fig. 1). What is an approximate lower bound on A ? (Hint: Find the 
lowest dimension operators made up of quark and lepton fields that could give rise to this 
decay mode). 

Exercise 5. Suppose that there are AB = 2 baryon violating operators due to new physics 
at a scale A, but no AB = 1 operators, so that the proton is stable, but n — n oscillations 
can occur. Such oscillations have not been seen, and the lower bound on the oscillation 
rate is 1.2 x 10 8 sec. How does this translate into a bound on the scale A? 

Exercise 6. Estimate the cross-section for photon-photon scattering at energies well below 
the electron mass, E 1 m e . Since ct = 1/137, counting powers of ct matters! 



3. The relevant, the irrelevant, and the marginal 

So far I have only discussed examples where the operator has dimension greater than 

4, so that the coefficient has negative dimension and the resulting cross-section or decay 
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width therefore becomes smaller as the energy scale E of the interaction gets smaller. Even 
though these are often the most interesting interactions — since they are harbingers of 
new physics at energies well above E — these sorts of interactions are called irrelevant. 
The rationale is that at low energies, their effects are small (for example, see eqs. ( |2.4|) , 
( |2.6|) .). In contrast, operators with dimension less than 4, whose coefficients have posi- 
tive dimension, are called relevant operators because they become more relevant at lower 
E. Ignoring quantum corrections, the only relevant operators one can write down in a 
relativistic field theory in four dimensions are 

• The unit operator (whose coefficient is the cosmological constant) which is dimension 
0; 

• Boson mass terms, which are dimension 2; 

• Fermion mass terms, which are dimension 3; 

• 3-scalar (^> 3 ) interactions, also dimension 3. 

(Terms linear in a scalar field can be removed by shifting its value). 

An example is the electron mass, arising from the dimension 3 operator tfjtfj with 
coefficient m e . In high energy scattering (E e ^> m e ) the effects of the electron mass 
are negligible. However, the effects of the electron mass are very important at energies 
comparable to m e . In fact, exercise 3. is only simple if one not only assumes that the 
momentum scales in \i and r decay are low compared to Myy, but also that they are high 
compared to m e , so that one could ignore the electron mass. As another example, consider 
two real scalar fields <p an d 3? with a Lagrangian of the form 

£ = i(c^) 2 + I(d$) 2 - \m 2 4> 2 - iM 2 $ 2 - i/^ 2 $ . (3.1) 

We will assume 

m ~ k <C M . 

We can see that unlike fermion fields, scalar fields have dimension 1, which means that the 
coupling k does as well: 

[0] = [$] = [«] = 1 . 

By our definition above, the three scalar interaction is relevant. Consider cp<p —>■ 4>4> scat- 
tering at tree level in this model. First take the case where the center of mass energy E^ 
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is much greater than m, M, and k. Then the scattering amplitude from a graph like fig. 
la — with the ipi replaced by cj> and the W, Z replaced by $ — is proportional to k 2 and 
the cross section must go as 

I / K n4 1 

020—20 \E 4 ,^>vn,M,K OC { — ) 

which goes rapidly to zero for large E^. Now look at the scattering cross section at an 
energy satisfying m <C E^ <C k, M, so that the particles are still relativistic, but the 
$ propagator and be contracted to a point as in figure lb. Now the cross section goes as 

i / K u 1 

020—20 1 m<^<K,M OC (— J 

Contrasting this low energy cross section with that for neutrinos in §2.3 explains why k 
interaction is said to be relevant at low energies, while Fermi interaction is called irrelevant. 

Operators with dimension 4 lie between relevancy and irrelevancy and are called 
marginal. Examples of marginal interactions are 

• (j) 4 interactions; 

• Yukawa interactions 

• Gauge interactions (interactions of a gauge boson with itself, a scalar, or a fermion). 

As we will see, marginality is an insecure position to be in, and quantum corrections will 
almost always change such operators from marginal to either relevant or irrelevant. 

In each of the examples in the previous section we focussed on irrelevant interactions. 
The only reason why this was interesting was that in each case, irrelevant operators gave 
the leading contribution to the process... and because they weren't too irrelevant. For 
example, neutrinos only interact with matter through irrelevant operators... so if one sees 
any evidence of low energy neutrino scattering, one is seeing irrelevant operators. In con- 
trast, e + e~ scattering has an electromagnetic contribution from photon exchange. Since 
the photon-electron coupling is a marginal operator, at low energies electromagnetic inter- 
actions dominate the weak interaction contribution. (No coincidence that these are called 
weak interactions!). Now imagine a world where the W and Z masses were 10 16 GeV. 
In this world there would be practically no discernible weak interaction effects. The neu- 
tron would have a lifetime greater than 10 30 years, and there would be no radioactivity; 
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no one would have guessed that the neutrino existed, because it would not interact with 
anything. All we would discern in particle collisions and spectra would be the strong and 
electromagnetic interactions. 

In fact, in any situation where there is a large gap between the energy where one is 
doing experiments and the energy scale of new physics, the effective theory one constructs 
will only consist of marginal and relevant operators... such theories are called "renormal- 
izable" and are a natural outcome when there is a large hierarchy of physical scales. This 
typically results in a vast simplification of the physics one needs to consider, as seen in the 
next example. 

3.1. Example: the success of Landau liquid theory. 

A condensed matter system can be a very complicated environment; there may be 
various types of ions arranged in some crystalline array, where each ion has a complicated 
electron shell structure and interactions with neighboring ions that allow electrons to 
wander around the lattice. Nevertheless, the low energy excitation spectrum for many 
diverse systems can be described pretty well as a "Landau liquid", whose excitations are 
fermions with some complicated dispersion relation but no interactions. Why this is the 
case can be simply understood in terms of effective field theories, modifying the dimension 
counting used above to suit a nonrelativistic system with a Fermi surface i. 

Let us assume that the low energy spectrum of the condensed matter system has 
fermionic excitations with arbitrary interactions above a Fermi surface characterized by 
the fermi energy ep ; call them "quasi-particles" . Ignoring interactions, the action can be 
written as 



where an arbitrary dispersion relation e(p) has been assumed. Now let us consider higher 
dimension operators... but how should we count "dimension"? In the relativistic case, we 
defined mass dimension in a simple way, since we wanted to do an expansion in E/A, 




ree — 




(3.2) 



3 The treatment here follows that of Polchinski in ref. Q . 
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Fermi surface 




Fig. 2. The momentum p of an excitation above the Fermi surface 
is divided into a component k on the Fermi surface, and a component 
£ perpendicular to the surface. The length of \£\ is the quantity one 
wants to scale. 



where E was the scale of the experiment and A was a physical scale associated with the 
system being probed. In a nonrelativistic system we identify the scaling dimension with 
momentum, in which case energy scales like p 2 . Furthermore, it doesn't make sense to 
expand around p = since an excitation cannot have a momentum vector inside the Fermi 
surface. So we write the momentum as 



where k lies on the Fermi surface and I is perpendicular to it (fig. 2). Then I is the 
quantity we vary in experiments and so we define the dimension of operators by how they 
must scale so that the theory is unchanged when we change I — > ri. If an object scales as 
r n , then we say it has dimension n. Then [k] = 0, [£] = 1, and [j d 3 p = J d 2 kd£] = 1. And 



and so [e — 6f] = 1 and [d t ] = 1. Given that the action ( |3.2|) isn't supposed to change 
under this scaling, 



p = k + £ 



if we define the Fermi velocity as V p e, then for £ ^ k, 



e(p)-e F = £-v F (k)+0(£ 2 ) , 




Now consider an interaction of the form 



int 




dt / n( rf2 ^^05 3 (i 5 tot)C'(A; 1 ,...,A ; 4)^ t bi)V' s b2)^(P3)V' s '(P4) • 
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This will be relevant, marginal or irrelevant depending on the dimension of C. Apparently 
[5 3 (P to t)C] = — 1- So how does the 5 function scale? For generic k vectors, 5(P to t) is a 
constraint on the k vectors that doesn't change much as one changes £, so that [5 3 (Ptot)} = 
0. It follows that [C] = — 1 and that the four fermion interaction is irrelevant... and that 
the system is adequately described in terms of free fermions (with an arbitrarily screwy 
dispersion relation). This effect is known in nuclear physics, where Pauli blocking allows 
a strongly interacting system of nucleons to have single particle excitations. 

It is amusing that when a pair of fcj vectors are within 0(£) of cancelling each other, 
then the scaling dimension of the delta function changes from to —1. To see this, fix set 
the €s to zero, and fix the incoming momenta k± and &2- The 5-function then generically 
constrains three out of the four degrees of freedom in the outgoing momenta ks and k^ 
in terms of k\ + ki- However, if k\ + k<i = 0, then k^ + k^ must equal zero, but that 
only constrains two of the four degrees of freedom (assuming a parity symmetric Fermi 
surface). Therefore the delta function 5 3 (p) must scale like 5 2 (k)5(£), and so for these 
head-on collisions between particles at opposite sides of the Fermi sea, [C] = 0, and the 
interaction is marginal. Quantum corrections either make it either irrelevant of relevant; 
it turns out that for C attractive, the interaction becomes relevant, and if it is repulsive 
it becomes irrelevant. In the former case, the interaction between such quasiparticles 
becomes strong near the Fermi surface, and can lead to pairing and superconductivity. See 
ref. |1| for more about this. 

Exercise 7. How would you couple phonons to the fermions in a Landau liquid? Would 
the phonon - fermion coupling be relevant, irrelevant, or marginal? 



4. Quantum corrections and renormalization 

It is fine to call a higher dimension operator irrelevant when one is computing am- 
plitudes at tree level, and the momenta flowing through the vertices is small. But what 
happens when one calculates quantum corrections (loop graphs) involving these irrelevant 
interactions and integrates over intermediate states of all energies? Do the irrelevant oper- 
ators become important? A field theory with irrelevant operators used to fill field theorists 
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Fig. 3. A divergent one-loop radiative correction to the fermion mass 
and kinetic term in a theory with a (ifjifj) 2 interaction. 



with horror, since they were "nonrenormalizable" . This meant that rather than having a 
finite number of counterterms that had to be fixed by some experimental measurement, 
one needed an infinite number. Such theories were thought to be unpredictive. QED is a 
good example of a renormalizable theory: Only two measurements are needed to fix the 
counterterms, namely a and m e . Once these quantities are measured in one set of exper- 
iments, all other QED processes can be predicted. In a theory with irrelevant operators, 
however, extra insertions of the operator in a graph makes it more divergent. In a theory 
with a Fermi interaction, for example — (V'V') 2 — one finds one needs counterterms for all 
(t^tf;) 271 operators. Furthermore, these operators can in general renormalize relevant oper- 
ators, such as the fermion mass, so it seems that all of these infinite number of interactions 
must be fit to experiment and nothing can be predicted. 

This quandary is avoided if one uses a mass independent renormalization scheme 
(dimensional regularization) , and thinks of the effective theory not as an expansion in 
operators, but as an expansion in inverse powers of some large physical scale A. Let us 
assume that we wish to do experiments at some momentum scale p and that the relevant 
operators have coefficients set by a scale m < p. In contrast, the irrelevant operators have 
coefficients which are inverse powers of A ^> m, p. For example, a theory of a fermion with 
mass m and higher order interactions: 



Now consider the divergent graph in fig. 3. 

This graph gives a divergent contribution to the mass operator tptj; proportional to 



C = ifji0tfj — rmfjifj 



A 2 



a ,— 



W) 2 
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When Wick rotated into Euclidian space and denned by dimensional regularization, the 
above integral equals (see eq. (A.l) in the Appendix) 

m 3 / 1 
16tt 2 A 2 V e 

where we are in 4 — 2e dimensions, and \x is the renormalization scale that creeps into the 
problem^. In a mass independent subtraction scheme we put in a one-loop counterterm 



+ 7 - 1 + In 



9 

m 

4tt / u 2 



that cancels the infinite part of this graph, as well as a mass independent finite part. For 



example, in the MS scheme, we subtract the part proportional to 

am 3 ( 1 



16tt 2 A 2 



V 7 — 1 — ln4-7r 



e 

We are left with a finite contribution to the fermion mass equal to (up to an 0(1) numerical 
factor which I have dropped) 



am 3 



2 

m 



(4.1) 



We choose a convenient scale \i and fit (m+5m) to experiment (once one has also calculated 
the one-loop wave function renormalization). 
The important point I wish to make is that 

5m m 2 
m 167r 2 A 2 ' 

it is small — it needs to be taken into account when probing effects proportional to 
but not otherwise. Note that this would not have been the case if we had simply taken 
A to be a physical momentum cutoff and not renormalized...then, since the fermion loop 
graph is quadratically divergent, we would have found 5m ~ (m/A 2 ) x A 2 ~ m. This 
would be a ludicrous state of affairs — we would have to understand quantum gravity, for 
example, to compute radiative corrections to e + e~ scattering. 

The above example has several important features which I wish to draw your attention 
to: (i) The correction to the electron mass 5m is suppressed by m 2 /A 2 ; (ii) 5m has a 
logarithmic dependence on the fermion mass and the renormalization scale fj,; (iii) The 
corrections to the fermion mass are proportional to the fermion mass. Each of these three 
points is worth commenting on: 



For a discussion of dimensional regularization, see for example refs. ||, @. For those familiar 
with the concepts, some useful formulas are included as an appendix to these lecture notes. 
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4-1. The size of radiative corrections 

Concerning the first point: it is an obvious and general result that in a mass indepen- 
dent subtraction scheme, corrections to low dimension operators due to high dimension 
operators are always suppressed by powers of p/A and m/A. This is not what one would 
finds simply putting A in as a momentum cutoff for one's integrals. It is an obvious result 
because the only new mass scale induced by dimensional regularization is /i, and that can 
be seen to only enter logarithms. Thus an integral with dimension n will be proportional 
to the n th power of the physical scales in the problem p and/or m. The scale A only enters 
the problem raised to negative powers at the vertices. Thus the graph is always propor- 
tional to (p/A) n where n is the combined powers from the vertices. No positive power of 
A is generated by the loop integral. 

The fermion mass in our theory does receive an infinite number of corrections from the 
infinite number of higher dimension operators, and they are only computable if I measure 
all of the coefficients of these operators. However the theory remains predictive, since at 
any finite order in m/A C 1 there are a finite number of contributions to 8m. 

4-2. Radiative logarithms and the scale \x 

The renormalization scale \x enters through logarithms of iijm or \ijp. If we could 
sum up all orders in perturbation theory, all our answers would be \x independent. How- 
ever, we stop at finite order, and our choice of \x can affect how quickly the perturbative 
expansion converges, since higher loop graphs yield higher powers of In// 2 /p 2 . Thus we 
should optimize perturbation theory by choosing \i to minimize the logarithm. When com- 
paring experiments at widely different physical scales, we may run across large logarithms 
then of the form ln(pf/p|) since the same \x cannot make the logs in the two processes 
simultaneously small. These large logs can be resummed using the renormalization group, 
discussed in a later section. 

4-3. Symmetry and naturalness 

I noted that 5m oc m in eq. (]4.1|) . This is because m — > increases the symmetry 
of the theory: in the above example the symmetry ip — > ^51^, ip — > — "075 is a symmetry 
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of the Fermi interaction (and kinetic term), but not the mass term. If m = it follows 
that 5m must also vanish. Therefore it is natural that the fermion mass might be small 
compared to other physical scales in the problem. In contrast, a scalar mass term <jxp* does 
not usually break a symmetry — the only exceptions are if the theory is supersymmetric, 
or if the scalar is a Goldstone boson. The latter is important for pion physics and chiral 
perturbation theory. Even if the tree level scalar mass is zero, it will get radiatively 
corrected by other fields it couples to. Thus it is unnatural for there to be a light scalar 
coupled to high energy fields. Since scalars presumably couple to gravity, typified by the 
Planck scale rap = 10 19 GeV, one has to wonder why the Higgs boson in the standard 
model has a mass in the 10 2 to 10 3 GeV mass range. (It has been suggested that in fact 
either there is no scalar Higgs boson, or that it is a Goldstone boson, or that it is a member 
of a supersymmetric multiplet of particles). 

It is ironic that it used to be that people were worried about theories with irrelevant 
operators being sick. In fact what we see is that irrelevant operators cause no problems; 
it is the relevant operators that we must worry about. If relevant operators appear in the 
effective field theory, then they must be set by a scale much less than A (else they wouldn't 
be in the effective theory below A) . But if their coefficients are much smaller than A without 
a symmetry reason, then we are baffled. The prime example is the cosmological constant, 
namely the dimension 4 coefficient of the operator 1, otherwise known as the vacuum 
energy density. There is no known symmetry that appears relevant to our world that is 
increased by setting the vacuum energy density to zero, yet from cosmological observations, 
the vacuum energy is known to be < 10 -46 GeV 4 0. The smallness of the cosmological 
constant should be taken as a warning: it appears contrary to effective field theory dogma, 
so the dogma may be flawed. 

Exercise 8. Compute both the wavefunction and mass corrections from the graph in fig. 
3, using the MS scheme. See the Appendix for dimensional regularization formulas. 



5. Matching 

Consider doing experiments with photons and electrons entirely within the context 
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of QED. There are three different regimes for scattering experiments which one might 
consider: 

1. Either photons or electrons or both in the incoming state, with momentum transfer 
large compared to m e ; 

2. electrons in the incoming state, but at momentum transfer much smaller than m e ; 

3. photons in the incoming state, but momentum transfer much less than m e . 

In the first case one needs to compute the relevant amplitude in the full QED theory, 
although at high energy one might make the approximation that the electron is massless. 
Furthermore, the fine structure constant has to be adjusted from its low energy value 
a = 1/137, and effect due to quantum corrections which we discuss in a later section. The 
second case is a little funny — we can ignore much of the complexity of QED since we 
do not have enough energy to produce positron-electron pairs, yet we still need to include 
both electrons and photons in the theory; I briefly mention the techniques one uses in this 
case in §7. For the third case one need only consider an effective theory with photons... why 
include electrons if one never sees any? 

The low energy theory of photons alone looks like 

C eff = -\F^ V + -L (a{F^Fn 2 + HF^Pn 2 ) + 0(l/m 8 e ) , 

the most general local, hermitian theory invariant under Lorentz, gauge, charge conjuga- 
tion and parity transformations. (Can you show why there are no irrelevant operators of 
dimension 6?). This is not QED because it distorts high energy physics... but we do care 
that it correctly reproduces low energy phenomenology If we did not know about QED, we 
could treat this as a phenomenological theory and try to fit a and b to measured scattering 
cross sections. However, we do know QED, and so we can compute a and b. To do this we 
simply require that Cqed and £ e // give us the same physical predictions at low energy. In 
general, ensuring that the effective theory agrees in its predictions with the full theory to 
any desired order of accuracy is called "matching" . What we are matching is the value of 
Green's functions in the two theories. Effective field theories are designed to reproduce all 
of the infrared (light particle) physics of the full theory, while distorting the high energy 
behavior to make calculations simpler. All of the interesting infrared effects in the full 
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theory due to light particles are explicitly included; only the effects of the heavy particles 
or high energy modes must be mocked up. So the correct thing to do is match all the "one 
light particle irreducible" (1LPI) diagrams (diagrams that do not fall apart when one light 
particle line is cut) , since these are the graphs that contain either a heavy particle, or high 
energy modes of a light particle. We cannot do this exactly of course, but we can do it 
systematically in a "loop" expansion, which is an expansion in powers of the numbers of 
loops in a diagram, or equivalently, powers of fti. 

5.1. Example: the </> 2 $ interaction. 

Rather than discussing QED, I will consider a toy model that exhibits nicely the 
matching procedure. It is the theory in eq. ( |3.1| ) with a light scalar <fi coupled to a heavy 
scalar $ via the interaction |k^> 2 $. (Never mind that the vacuum energy is unbounded 
below; one won't see this in perturbation theory). Suppose we are interested in 2(f) — > 2(p 
scattering at energies much below the $ mass M. The graphs we have to match to order 
h are those in fig. 4. 

At tree level, $ exchange generates a </> 4 interaction in the effective theory, so we find 

that 



where the number cq is dimensionless and 0{1), and computable from the graphs. The . . . 
refers to operators such as (k 2 /M A )cj) 2 d 2 (f) 2 that one finds expanding the one-$ exchange 
diagram to order p 2 . If I had included a $ 3 interaction in the full theory, there would 
have been more complicated tree diagrams leading to operators with higher powers of cf> 
in £- e ff. The tree level matching condition is shown at the top of fig. 4. The graphs on 
the left are $ exchange graphs in the full theory, while the contact interaction on the right 
is a local operator in the effective theory. For nonrelativistic </> particles, the procedure is 
equivalent to replacing the short range Yukawa potential due to $ exchange with a 5 3 (r) 
potential with a suitably matched coefficient. 

5 It may seem funny expanding in a dimensionful quantity we set to unity! However the loop 
expansion can be seen to be consistent with a perturbative expansion in coupling constants — see 
Coleman's lecture "Secret Symmetries" in ref. ||. 
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Fig. 4- Matching conditions for the theory of eq. (\3. 1\) . Diagrams on 
the left are in the full theory, while those on the right are in the effective 
theory. Heavy lines correspond to the heavy scalar propagator; numbers 
beneath the vertices count the loop order of the matching condition. 
The first row is the complete tree level matching condition; second 
and third rows are the one-loop matching conditions for the two- and 
four-point vertices respectively. Note that matching conditions are not 
simply the contraction of heavy propagators to contact interactions. 

Now consider matching at 0(K). We must consider graphs with both 2 and 4 external 
<p fields. First consider the ones with two external fields. The mass renormalization 
graphs are divergent in both theories and are computed in MS To avoid large logarithms 
in the matching conditions of the form M 2 lnM 2 /^ 2 we choose the renormalization scale 
fj, = M. Then the loop graphs in the second line of fig. 4 are well defined, finite objects, 
and the equation defines the 0(Ti) 4> 2 interactions of the effective theory, labeled by a "1" 
on the right side of the equation. Including these terms, the kinetic term of £ e f f becomes 

K i+a ™) w)2 -K m2+ ^> 2 

where a± and b\ are again dimensionless, 0(1), and computable from the graphs. I have 
explicitly pulled out of the graphs the dimensionful quantities and the factors of 1/167T 2 
that arise from the loop integration. 

Some of the graphs with four external 0's are shown on the third line in fig. 4. 
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With zero external momentum, the graphs are approximately equal to k 4 / (167t 2 M 4 ) times 
logarithms. The logarithms blow up in the limit that the mass m goes to zero (an 
"infrared divergence"). However, the loop graphs in the effective theory have the exact 
same infrared divergence. Therefore the 0(h) contribution to a <^ 4 interaction in the 
effective theory (labelled by a "1" on the last line of fig. 4) does not blow up as m — > 0. 
After 1-loop matching has been performed, the effective theory looks like: 



where the coefficients a, b and c are 0(1). In addition there are higher dimension operaotrs, 
such as </> 6 , (4>d 2 4>) 2 , etc. This Lagrangian can be used to compute 2(f) — > 2(f) scattering 
up to 1 loop. One can perform an ai-dependent rescaling of the <f) field to return to a 
conventionally normalized kinetic term. 

Let me close this section with several comments about the above example: 

• Notice that the loop expansion is equivalent to an expansion in (k 2 /16tt 2 M 2 ). To the 
extent that this is a small number, perturbation theory and the loop expansion makes 
sense. 

• We only computed relevant operators. There are in addition effects that are suppressed 
by powers of E 2 /M 2 in an experiment with energy E (irrelevant operators). These 
may be as important as a subleading correction to a relevant operator's coefficient. 

• We see an example of naturalness: the matching correction to the scalar mass is not 
proportional to m , so that it is "unnatural" for the physical mass to be <C ~ 
that would require a finely tuned conspiracy between m 2 and k 2 . For k and m both 
very small there is a symmetry regained in the full theory, namely the shift symmetry 
(f> — > (f> + constant, which explains why <f) can be naturally light in this limit. 

• The coefficients of operators in the effective field theory are regularization scheme 
dependent. Their values differ for different schemes, but physical predictions do not 
(e.g, the relative cross sections for 2(f> — > 2(f> at two different energies). 

• The coefficients of operators in the effective field theory are ji dependent, where \i is 
the renormalization scale. (More on this below). 




(5.1) 
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• In the matching conditions the graphs in both theories have pieces depending nonan- 
alytically on light particle masses and momenta (eg, In m 2 /M 2 or In p 2 /M 2 )... these 
terms cancel on both sides of the matching condition so that the interactions in C e f / 
have a local expansion in inverse powers of 1/M. This is an important and generic 
property of effective field theories. 



Exercise 9. Compute the graphs in fig. 4, using the MS scheme, and determine the 
coefficients a, b, and c in eq. (|5. 1\). 



Exercise 10. Draw a graph in the full theory that is not 1LPI ("one light particle 
irreducible") and convince yourself that that it is included in the effective theory, provided 
one matches all 1LPI graphs. 



6. Quantum corrections: the myth of marginality 

We have seen that relevant interactions — those with dimension < 4 (or < d in d 
dimensions) - - dominate physics at low energies. Marginal interactions (dimension 4) 
would appear to be equally important at all scales. In fact, quantum corrections change 
the scaling dimension of operators from their classical value. This doesn't usually have 
a dramatic effect on relevant or irrelevant operators, but for marginal operators it means 
that they become either relevant or irrelevant. 

6.1. Renormalization group and 4> 4 theory 

To be concrete, consider <p theory with the Lagrangian 

£ = i^) 2 -im¥-^. (fU) 

Consider the calculation of a the 1PI Green's functions T n , which are one particle irre- 
ducible graphs that have had the external propagators amputated. They can be directly 
related to scattering amplitudes. Ignoring the issues of renormalization, one would expect 
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to express these Green's functions in terms of the external momenta, the particle mass m, 
and the coupling constant A: 

r„(pi, ...,p n ;m, A) . 

The dimension of this objecti is (4 — n) . Therefore if one scales all of the external momenta 
by a factor s, one expects 

T n (sp; m, A) = s^ n T n {p- m/s, A) . (6.2) 

This expresses precisely what I was saying earlier about how the scalar mass is a relevant 
operator — note that its effects become large for small momentum scales, corresponding 
to s < 1. On the other hand, the </> 4 interaction's marginality is the observation that the 
importance of the A coupling is independent of scale. 

This analysis is incorrect when quantum corrections are taken into account, due to 
the introduction of a new scale \x. When we compute in perturbation theory, we must 
include counters and define the renormalized Lagrangian 1 

Cven. — C ~\~ £>ct 

where 

r l/Qi \2 1 2/2 ^0,4 

Cren = jW) - 2 TO 0<?>0 ~ 



A 



C ct = ±A(d<P) 2 - \m 2 Bf - fi 2e ^C<j>* . 

Both £ and C c t must be regulated; here I have chosen dimensional regularization, and a 
factor of /U 2e is inserted to keep A dimensionless, where \i is the arbitrary renormalization 
scale. The Lagrangian £ is written in terms of finite parameters, but gives infinite results; 
C c t gives the counterterms A, B, C which all have 1/e poles in dimensional regularization 
and blow up in the e — » limit. Computing graphs with the sum C ren = C + C ct , which 



6 T n is the time ordered product of n scalar fields (d = n) Fourier transformed to momentum 
space (d = —An) with n external propagators removed (d = 2n) and a factor of 8 4 (ptot) factored 
out (d = 4)... this gives d = 4 — n. 

7 See Ramond's book || for details; also see David Gross' 1975 Les Houches lecture ||. 
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is written in terms of "bare" couplings and fields, yields finite answers. The obvious 
correspondence between bare and renormalized parameters is: 



y/l + A, 



m\ = m 2 (l + B)/Z (j) , A = A(l + C)/Z2 . 



We can treat Ao, mo, \i and e as independent parameters, and express A and m in terms 
of them. 

We can now define either bare or renormalized Green's functions, r° and V respec- 
tively. The relation between the two is 

r°(pi, -,Pn] A ,m ,e) = Z~ n/2 Y n (p u ...,p n ; A m, /i, e) 

where T n is finite as e — > 0. Using the fact that r° is independent of \i, so that dT^/dfx = 0, 
one can derive the renormalization group (RG) equation 



d n d d 







(6.3) 



where (3 = fidX/dfx, 7 m = fxdm/dfx, 7 = \ \id In Z^/dfi. One can compute these functions 
in perturbation theory by relating m, and A to mo, Ao and fx and e. For <fi 4 theory one 
finds to leading nonzero order in perturbation theory 



/3(A) 



3A^ 



A 



lr, 



1 



A 



(6.4) 



16tt 2 ' 16tt 2 ' ' 12 V16tt 2 

The reason why the RG equation is useful is because it tells one what happens if 
one scales the external momenta, given that there is a new scale in the problem, \x. On 
rescaling momenta by s, eq. (|6.2|) must be modified to read 



r n (sp; rn, A, n) = s 4 n T n (p; m/s, A, fi/s) 



or equivalent ly 



d d d tt . 

s— + m- h /i- (4 - n) 

dm d[i 



ds 



T(sp; m, A, y) = 0. 



(6.5) 



(6.6) 



This can be combined with the renormalization group equation (|6.3|) to yield an equation 
which relates the scaling of s to changes in m and A alone, and not 



~ S ~d~s + 9A + ~ dm ~ U1 + ~ n 



T n (sp; m, A, 11) = 0. 



(6.7) 
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Ms) 



log s 



Fig. 5. The solution for the running coupling A(s) as a function 
of ln(s) . The one-loop expression becomes infinite at finite ln(s) = 
167r 2 /3A 2 , £/ie result is not to be trusted since the perturbative 
expansion breaks down. 



If one uses a mass independent subtraction scheme such as MS, then the coefficients 
P, 7 m and 7 depend only on A and not on the other dimensionless quantity, m//x. In this 
case, one can solve eq. (p.7[), and one finds 

r n ( S p; m, A, (j,) = s*- n T n (p; m(s),\(s), fi)e~ n K ^flC-'))/-' ( 6 . 8 ) 

where A and m satisfy the differential equations 

d\(s) 



ds 

drn(s) 



(3(X(s)), A(1) = A 



(j m - l)m(s) , m(l)=m. 



ds 

First look at this solution at tree level, where f3 = 7 m = 7 = and V is independent 
of \i. Then the solution (|6.8| ) is equivalent to the simple scaling property (|6.5| ). If only 7 
is nonzero, and it is constant, then the exponential in eq. ( |6.8[ ) gives an overall factor of 
s~ ni to the scaling of T...the engineering dimension (4 — n) is modified by an additional 
factor of —7 for each of the n fields, hence the name "anomalous dimension" for 7. Finally, 
if (3 and 7 m are nonzero, then changing the momentum scale means one lets the mass and 
coupling "run" . Using the (3 function in eq. ( |Q| ) , one finds 



= — > AGO 



16tt 2 w 1 - (3A/167r 2 )lns ' 

See fig. 5. 

We see that the (j) 4 interaction is an example of a marginal interaction that becomes 
irrelevant due to quantum corrections: the lower the energy scale probed in a scattering 
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experiment, the weaker the effect of the interaction. QED is another example — the gauge 
interaction becomes irrelevant due to quantum corrections. There is a simple physical 
explanation for this: the vacuum acts as a dielectric, with virtual particle-antiparticle 
pairs which screen charges. The greater the impact parameter in a scattering experiment, 
the more screened the charge is and the weaker the interaction. This can be parametrized 
by a scale dependent fine structure constant, a (fx). As fx — > 0, a(fx) — > 0. In QED, the 
screening ceases over distances longer than the Compton wavelength of the electron, and 
so a((i) — > 1/137 for \i < m e . Theories such as QED and 4> 4 all by themselves are called 
"asymptotically unfree" . they are thought to be meaningless as theories because of what 
happens in the ultraviolet: In (J) 4 theory one finds nonperturbatively (ie, on the lattice) 
that A (//) — > oo for /i — > /iq for a finite /xq- QED probably behaves similarly, although 
people debate whether a may approach a constant for sufficiently large \i (a "nontrivial 
fixed-point" ) . 



6.2. Renormalization group and QCD 

In contrast, Yang-Mills theories such as QCD have a negative /3-function and are 
asymptotically free: the gauge interactions, which are marginal at tree level, become rel- 
evant. The important physical difference between QED and Yang-Mills theories that ac- 
counts for the different sign of the /3-function is that Yang-Mills gauge bosons carry charge, 
while photons do not. For QCD, the f3 function at one loop order with Nf flavors of (Dirac) 
quarks is 



dg g 



3 



-11 + 



2N f 



bog 



3 



(6.9) 



3 

For Nf < 16 this is negative, and so it is negative in the standard model where Nf = 6 
(u,d,s,c,b,t). Defining a s = g 2 /Ait, eq. (|6.9|) can be integrated to give 



l/a s (/Lt ) + 47r&o ln(/i//u ) Arcbo ln(/u/AgcD) 
Notice that a new scale has crept into the theory — Aqcd- It has been defined as 

A Q CD = ^V4^oa(Mo) ) (610) 
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Fig. 6. The solution for the running coupling a s (ii) in QCD using the 
one-loop (3 function. The behavior near fx = Aqqd is unreliable where 
a s is large. 



and is independent of (j,q (to the order we are working in perturbation theory). This is 
the scale that determines where the strong interactions get strong, what the proton and p 
masses are, etc. It's value is scheme dependent, and depending on which sensible scheme 
one uses, it can range from 100 — 250 MeV i. See fig. 6 for a plot of a s (n). 

The reason why we call the strong interactions "strong" is because the beta function 
is negative, and there are some light quarks. Even though the the gauge coupling a s 
has positive dimension, it is small when a s is small and the interaction is only barely 
relevant. In contrast, quark masses start off with a classical dimension 1. Assuming for 
the moment that QCD with explicit quark masses was the true theory (ie, ignoring the 
weak interactions and the Higgs), let us look at the theory from the vantage point of the 
Planck scale. We would see a small a s with sluggish logarithmic scaling racing against tiny 
quark masses with linear scaling properties. Which one wins in the infrared? For the top 
quark, the mass term wins — a s (m t ) ~ 0.1 and the toponium (tt) mass is determined by 
2mt with only small {ot 2 s m t ) Coulomb corrections. In contrast, the gauge interaction wins 
in the race against the u and d quarks, which have masses of order 10 MeV. The proton 
- a uud bound state — has a mass equal to 940 MeV, which is scarcely affected by the u 
and d quark masses. Its mass is attributable to the effects of the strong gauge interaction, 
which is associated with a nonperturbative (scheme dependent) mass scale Aqqd ~ 200 



MeV (MS). The strong interactions are strong because gluon interaction is relevant and 



8 One does not determine it by measuring where a s blows up! Instead one determines a s at 
some large scale, such as the Z mass, where QCD is weakly coupled and a perturbative calculation 
of (3 makes sense. Then A is defined (at one-loop order) by eq. ( 6 . 1 0| ) . 
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the u, d and s quarks are light □ 

6.3. The RG and perturbation theory 

The (3 function for marginal interactions can be computed without much problem in 
perturbation theory; similarly one can see how relevant and irrelevant operator dimen- 
sions are modified (anomalous dimensions). My favorite treatment of the subject — called 
renormalization group analysis — is in the book by Ramond ||, §4.5. Unfortunately, 
perturbation theory, with its attendant divergences and counterterms, may be a practical 
computational tool, but it tends to obscure the beautiful physics behind the renormal- 
ization group. Wilson thought of the effective action Sa as a theory with all modes of 
frequency u > A removed. Sa-sa could be defined as 



where the integral is a path integral over all modes with frequency A — 5A < uj < A. 
Since the integration is over a finite number of modes (assume the system is in a box), the 
integration is finite and one needn't ever discuss counterterms, renormalization, etc. The 
action can then be shown to obey a differential equation 



where F is a functional of the action. Couplings in the effective action "flow" as one 
changes the cutoff A. Those with negative eigenvalues are irrelevant, and their coupling 
flows to zero in the infrared (limit of small A); positive eigenvalues correspond to relevant 
operators and their effects become stronger in the infrared. Wilson's picture is in many 

9 It should seem peculiar to the reader that the quark masses are scattered within a couple 
orders of magnitude of Aqcd — this doesn't seem natural from the Planck scale perspective. In 
fact the situation is complicated by the fact that above the weak scale, quarks don't have masses, 
but rather Yukawa couplings to the Higgs. The mystery then becomes, why does the Higgs get an 
expectation value within a couple orders of magnitude of Aqcd? Why do the Yukawa couplings 
range from 10~ 5 (for the u quark) to 1 (for the top quark)? There are a range of explanations to 
these questions in the literature, with a range of plausibility, but there is no evidence for any of 
them presently. 
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ways less confusing than the perturbative renormalization group discussion, but it is not 
practical for analytic computations. 

At any finite order in perturbation theory, one's answers will be \x dependent, and 
one has to pick a scale. Changing n — > // corresponds to changing the coupling constant 
9 - 9'- If 

P = bg 2 

for example, then 

9 = YZTT t = g[l + bgt + (bgt) 2 + . . .} 

where t = ln/i/^/'. So a calculation to second order in #(//) includes an infinite number 
of terms in a perturbative expansion in g(n) Infi/fi' . Scaling g is said to "sum the leading 
logs". Using the two loop (3 function sums terms like (g 2 t) n , the "subheading logs". So 
using different values for \i does change results of a calculation to some finite order in g. In 
practice then one wants to choose the value for \i that makes the perturbation expansion 
converge most quickly. Typically, that means choosing /i to make the logs small, which 
means choosing \x to be a physical scale in the process of interest, eg the q 2 flowing through 
the graph. This is good news if you are doing QCD at q 2 = (100 GeV) 2 , since a s {y) ~ 0.1 
at that scale; it is bad news if you wish to do a QCD calculation at q 2 = (500 MeV) 2 
since there is no perturbative expansion in a s at that low value for /i. To figure out the 
right value of \i one really needs to compute next order corrections and find the \x that 



minimizes them. There are interesting prescriptions for choosing \i in the literature [|TU . 
The procedure for computing photon-photon scattering at q 2 <^ ml should be clear 

now: 

• Match QED to an effective theory without photons, choosing \x ~ m e ; 

• Compute the /3's and 7's of the effective theory; 

• Change \i to the q 2 scale of the process of interest, scaling the parameters of the 
effective theory; 

• Compute the process of interest. 

Of course, the scaling isn't very interesting in low energy QED — the only interactions are 
irrelevant, and the lowest order 4- photon vertex does not run with //; only higher order 
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operators do. Where scaling effects are important is when an interaction is pretty strong 
(eg, scaling effects due to gluons in the perturbative regime), or when the scaling is over 
a huge energy range (eg, the computation of sin 2 9 W from GUT theories. See ref. plj].). 
See lecture notes by A. Manohar from the 1995 Lake Louise Winter Institute for a nice 
discussion of the scaling of AS = 1,2 operators from the weak interactions due to gluons 



Exercise 11. Given that to leading nonzero order the renormalization parameters C and 
in eq. (\6.4) for </> 4 theory (MS ) are C = — g|^2 \ & n d = 1 — \ > show that (3 is 
as given in eq. ffi.4j)- Hint: use the fact that Ao is independent of fx, work consistently to 
a given order in perturbation theory, and only set e —>■ at the end of the calculation). 



Exercise 12. Compute a s at fi= 2 GeV using the one-loop (3 function for QCD, given 
that a s at (i = 90 GeV equals 0.12. For the sake of this calculation, assume that the b 
quark mass is mi, = 4.5 GeV. All the other quarks are lighter than 2 GeV, except the top, 
which is heavier than 90 GeV. Assume there are no other colored particles. 

Exercise 13. Explain how you know that the four photon vertex doesn't run with (jl in 
effective QED below m e , in the MS scheme. 



7. Effective field theory with heavy stable particles 

Previously I mentioned that one might be interested in scattering electrons and pho- 
tons at energies much below the electron mass. One immediately encounters a problem 
in constructing the effective field theory in terms of local operators constructed out of 
the electron and photon fields and their derivatives: an operator such as (el/) 2 e)F 2 /m\ is 
not actually a 1/m^ effect since the time derivatives acting on an electron at rest bring 
powers of m e into the numerator. The solution is similar to what we have always done in 
nonrelativistic physics... ignore the electron rest mass and redefine the electron field to get 
rid of the exp(-imt) phase and assume that the remaining frequency — corresponding to 
the kinetic energy — is much smaller than m. 
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A free, charged, nonrelativistic particle obeys the Schrddinger equation: 

i(d t - ieA )V = -± . 

2m 

in the infinite mass limit, we can ignore the kinetic energy which goes as 1/m and so 
= is the equation of motion. We can restore relativistic covariance by defining the 
four velocity vector, which equals (1,0,0,0) in the rest frame of the particle, and so the 
equation of motion becomes 

VpD^ = , 
and the kinetic term in the Lagrangian is 

C = ^vD^>. (7.1) 

How does one get from the relativistic field theory to one with a kinetic term like 
([7.1|) ? One defines the momentum of the heavy particle to be 

where k <C m. Then for a scalar field, one rewrites <fi as = e~ lTnv ' x ^ v . Then one removes 
the positive frequency component of \& which creates antiparticles, and writes the most 
general Lagrangian in terms of the negative frequency component an d its derivatives. 
The result is a theory which involves an expansion in k/m, assumed to be small. One then 
integrates of v's to restore Lorentz covariance. 



The procedure for fermions is to define |T3[ 



h v = l±l e im^v.x^ j 

where ijj is the heavy fermion field. The (1 + V)/2 projection operator eliminates the 
"small components" of the spinor, which are suppressed by 1/m. In the large mass limit 
processes do not change v, and so one then constructs the effective lagrangian C v out of h v 
and expands in powers of d^/m; then one integrates C v over velocities v. All applications 
of interest have extensive symmetries that limit the form of the higher dimension operators 
that one can write down in the effective theory. 
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This procedure has been used extensively over the past few years to analyze hadrons 
containing a heavy (6 or c) quark by constructing an effective theory in powers of m& 
and m c . Another application has been the interaction of pions with baryons, treating the 
baryons as heavy fields. See jnj for a discussion of heavy quark effective field theory, and 
[nj for the heavy baryon formalism. 



8. Effective field theory for the strong interactions: chiral Lagrangians 

Symmetry is the only reason we know about that can explain why mass hierarchies 
occur. The proton mass is much lighter than the Planck scale, and we can qualitatively 
understand that by noting that m p arises from spontaneous breaking of chiral symmetry. 
Chiral symmetry breaking is a nonperturbative effect which is expected to occur at a scale 
^ e -a/a s ^ gee e q (|6,1Q|) ), where a is some number and a s is the strong coupling at the 
scale /i, which might be the Planck scale. Then the reasonable number a/ a ~ 40 explains 
the observed (enormous) hierarchy. Attempts have been made to similarly explain why 
Mw is so much lighter than the Planck scale, but no convincing theory exists. 

Chiral symmetry is a symmetry that keeps fermions light. Symmetries can also keep 
bosons light, but only if they are spontaneously broken. Then Goldstone's theorem guaran- 
tees that there will be massless Goldstone bosons. If what is broken is only an approximate 
symmetry, then one finds "pseudo Goldstone bosons" which are light but not massless. As 
you have heard in Professor Holstein's lecture, this is the explanation for why the pion is 
much lighter than the rho meson, and that one can construct an effective field theory of 
pseudoscalars and baryons to describe low energy strong interactions. Although the hier- 
archy here is not too large, the chiral effective theory pioneered by Weinberg has had many 
successes. And even though it is easy to probe physics far above its range of validity, our 
theoretical failings mean that we cannot analytically compute the coupling constants of the 
chiral Lagrangian by matching with QCD, but must determine them phenomeno logically. 

I don't have time to say much about chiral Lagrangian calculations, but I do want to 
make a comment about power counting. The pions have two mass scales associated with 
them: their mass = 140 MeV, and their decay constant f v . This is variously defined; 
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I take 

(0\& a \* b ) = iUp> l S ah 

f A a = qi^T a q T a = ^ 
U ^ 93 MeV 
The leading operator in the chiral Lagrangian is 

f 2 

— TrdEdE 1 
4 

where 

E = e 2iir a T a /f _ 

This term is universal; it depends on the scale / and on the symmetry breaking pattern 
5*7(2) x SU(2) -> 5*7(2). It does not depend on any other detail of QCD. In fact, the 
system is so highly constrained by symmetry that one gets the same effective theory from 
QCD, a linear a model, or the NJL model! This makes the chiral Lagrangian one of 
the preeminent examples of how an effective field theory "loses" information about short 
distance physics. However, the pion mass term (TrM 9 £) and higher dimension operators 
(eg, Tr(d£d£t) 2 ) are not universal, and measuring their coefficients tells us (indirectly) 
about QCD dynamics. But what is the mass scale these higher dimension operators are 
being expanded in? if the theory is an expansion in p n / then it isn't of any use in the 
real world. In fact the expansion should probably be in inverse powers of m p or some 
higher scale. A nice power counting scheme was developed by Weinberg and discussed in 
detail by Georgi and Manohar [T(J . They argue that the "natural" scale for the derivative 



expansion in powers of d/A is A < 4nf n . The argument is based on the requirement that 
the coefficient of an operator receive radiative corrections no larger than the tree level value. 
In the real world, it seems that A ~ 47r/ 7r works pretty well, and so chiral perturbation 
theory (ie, exploitation of the derivative expansion) works pretty well for pions, up to ~ 500 
MeV in some channels. Certain features work well for kaons as well. Chiral Lagrangians 
have been applied to nuclear matter for both pion and kaon condensation (eg, refs. JT] 
and nuclear forces ]18 . 
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9. Conclusions: Why effective field theory? 

Effective field theory is a useful tool to be learned. Using effective field theory makes 
computations simpler — one needn't compute features of a complicated field theory that 
rare of no interest in low energy physics; and in conjunction with the renormalization group 
one can simply solve problems that involve disparate scales. To learn how to do this one 
must work through examples. Here are a handful of effective field theory calculations in 
the literature which I think are instructive: 
1. Renormalization group calculation of sin 2 6 W from a theory at 10 14 GeV: ref. ]TT[ . 



2. The matching and renormalization group scaling of AS = 1 operators from the weak 



scale down to the hadronic scale: ref. 19 



3. Computation of the charmonium binding energy in nuclei: ref. 



4. Parity violating operators for nuclear physics in the chiral Lagrangian: ref. |2(| . 

5. Chiral perturbation theory with heavy baryons: ref. ||21|| . 

6. Fitting properties of the A(1405) to experiment at the 1-loop level in chiral perturba- 
tion theory: ref. ||22||. 



7. Chiral perturbation theory for hadrons with a heavy quark: ref. |23[ . 
In addition, there are three other recent reviews I recommend on effective field theories, 
all with quite different content, refs. |1[], 0, and [[12]. 

Effective field theory is much more than useful tool, however — it is a paradigm for 
considering all of physics, illuminating the reason why physics looks "simple" : To a first 
approximation we needn't understand quantum gravity to understand the top quark; we 
needn't know about the top quark to understand the hydrogen atom; the details of atomic 
structure are irrelevant for hydrodynamics; and we needn't understand hydrodynamics to 
compute the orbits of the celestial bodies. Some may hungrily await the final theory of 
everything, but effective field theory allows others of us to take small bites of something 
in the meanwhile. 
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Appendix A. Dimensional Regularization Formulas 



Consider the following integral in re dimensions with a Euclidian metric: 

h = I d n k ' 



(k 2 + a 2 ) r ' 

We may evaluate this making in terms of the V function: 



\s) = [ 
Jo 



oo 

S — 1 —OLX 



n-T(-s) - / d.r .r"- s v 

Then 



1 

n n/2 foe 



oc 



7i = — - / d n fc / dx x 7 "" 1 e" x(fc +a ) 

r(r) 



o 



r(r) y 



= W V" 2r 

r(r) 

Another useful integral is 

h = J d n k 



dx x r - 1 - n/2 e~ xa2 
T(r - re/2) 



A- 



(A; 2 +a 2 ) r ' 

To get this we define 

+ a 2 ) r . 



7i (a) = / d n A; — — !- 
7 (a/c 2 + 



= a-"/ 2 7! 

then by differentiating by a and setting a = 1 we find 

reW 2 a n - 2r + 2 T(r - 1 - re/2) 



2(r-l) r(r-l) 



Finally note that 



(/c 2 + a 2 ) 

-'2 



r 



re 
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Some Properties of V Functions 



T functions have the property F(z + 1) = zT(z), with T(l) = 1. Thus for integers 
n > 1, 

T(n + 1) = n\, n > 1 . 



Also useful is the value 



r(i) = V5F 



The T function is singular for non-positive integer arguments. Near these singularities 
it can be expanded as 



r(-n + e) = 



(-1)' 



n 



^ + iP(n + 1) + O(e) 



where 



In particular, 



V>(n + 1) = 1 + 1 + ... + - -7 , 
z n 

7 = 0.5772 . . . 



r(e-l) = — +7-1 

r(6) = i- 7 



Useful consequences: 



/i 



2e 



m 



d 4 ~ 2e g 1 
(27r) 4 " 2e +m 2 ~ 16tt 2 



— +7- 1 -ln47r + ln(m 2 //U 2 ) (A.l) 



2e 



d 4- 2e? 



1 



1 



(2tt) 4 - 2£ ( 9 2 + m 2 ) 2 16tt 2 



1 



— 7 + In An — ln(m /// ) 



(A.2) 
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